Sound Texture Synthesis Informed by Statistics
نویسندگان
چکیده
This internship report deals with the synthesis of sound textures based on a statistical description. The synthesis model under investigation was proposed by Josh McDermott and Eero Simoncelli in 2011 (McDermott & Simoncelli, 2011). First the definition of what is a sound texture will be discussed. This is followed by a description of model under investigation, insights on the theoretical aspects of the model and on its implementation will be given. The limitations of the algorithm will be discussed and related to the theoretical definition of sound textures. Some proposition to possibly improve the implementation are made. Finally some preliminary work on identifying links between higher-level parameters, such as density, and low-level statistics will be presented. Ce rapport de stage traite de la synthèse des textures sonores basée sur une description statistique du son. Le modèle de synthèsé etudié est le modèle introduit par Josh McDermott et Eero Simoncelli en 2011 (McDermott & Si-moncelli, 2011). Après avoir essayé de définir plus précisément en quoi consiste exactement une texture sonore, nous détaillerons le fonctionnement du modèle sur le plan théorique. Une description détaillée de l'implémentation será egalement présentée. Des propositions pour améliorer l'implémentation seront présentées. Les limites des possibilités de synthèse du modèle seront discutées et mis en relation avec la conception théorique que nous avons des textures sonores. Finalement un travail préliminaire portant sur l'identification de liens entre des paramètres de haut-niveau, tels que la densité d'une texture sonore, et l'encodage statistique de bas-niveau sera présenté.
منابع مشابه
Synthesis of Sound Textures with Tonal Components Using Summary Statistics and All-pole Residual Modeling
The synthesis of sound textures, such as flowing water, crackling fire, an applauding crowd, is impeded by the lack of a quantitative definition. McDermott and Simoncelli proposed a perceptual source-filter model using summary statistics to create compelling synthesis results for non-tonal sound textures. However, the proposed method does not work well with tonal components. Comparing the resid...
متن کاملSound Texture Perception via Statistics of the Auditory Periphery: Evidence from Sound Synthesis
Rainstorms, insect swarms, and galloping horses produce "sound textures"--the collective result of many similar acoustic events. Sound textures are distinguished by temporal homogeneity, suggesting they could be recognized with time-averaged statistics. To test this hypothesis, we processed real-world textures with an auditory model containing filters tuned for sound frequencies and their modul...
متن کاملSound Texture Synthesis with Hidden Markov Tree Models in the Wavelet Domain
In this paper we describe a new parametric model for synthesizing environmental sound textures, such as running water, rain, and fire. Sound texture analysis is cast in the framework of wavelet decomposition and multiresolution statistical models, that have previously found application in image texture analysis and synthesis. We stochastically sample from a model that exploits sparsity of wavel...
متن کاملState of the Art in Sound Texture Synthesis
The synthesis of sound textures, such as rain, wind, or crowds, is an important application for cinema, multimedia creation, games and installations. However, despite the clearly defined requirements of naturalness and flexibility, no automatic method has yet found widespread use. After clarifying the definition, terminology, and usages of sound texture synthesis, we will give an overview of th...
متن کاملDescriptor-based Sound Texture Sampling
Existing methods for sound texture synthesis are often concerned with the extension of a given recording, while keeping its overall properties and avoiding artefacts. However, they generally lack controllability of the resulting sound texture. After a review and classification of existing approaches, we propose two methods of statistical modeling of the audio descriptors of texture recordings u...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013